An ATR-PrimPol pathway confers tolerance to oncogenic KRAS-induced and heterochromatin-associated replication stress

Activation of the KRAS oncogene is a source of replication stress, but how this stress is generated and how it is tolerated by cancer cells remain poorly understood. Here we show that induction of KRASG12V expression in untransformed cells triggers H3K27me3 and HP1-associated chromatin compaction in an RNA transcription dependent manner, resulting in replication fork slowing and cell death. Furthermore, elevated ATR expression is necessary and sufficient for tolerance of KRASG12V-induced replication stress to expand replication stress-tolerant cells (RSTCs). PrimPol is phosphorylated at Ser255, a potential Chk1 substrate site, under KRASG12V-induced replication stress and promotes repriming to maintain fork progression and cell survival in an ATR/Chk1-dependent manner. However, ssDNA gaps are generated at heterochromatin by PrimPol-dependent repriming, leading to genomic instability. These results reveal a role of ATR-PrimPol in enabling precancerous cells to survive KRAS-induced replication stress and expand clonally with accumulation of genomic instability.

For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section.

n/a Confirmed
The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly The statistical test(s) used AND whether they are one-or two-sided Only common tests should be described solely by name; describe more complex techniques in the Methods section.
A description of all covariates tested A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted Give P values as exact values whenever suitable.
For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated Our web collection on statistics for biologists contains articles on many of the points above.

Software and code
Policy information about availability of computer code Data collection 1. Immunoblot / Gel staining / Colony staining were imaged using LAS 3000 luminescent image analyzer (Fujifilm).

March 2021
Data Policy information about availability of data All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: -Accession codes, unique identifiers, or web links for publicly available datasets -A description of any restrictions on data availability -For clinical datasets or third party data, please ensure that the statement adheres to our policy RNA-seq data from SAEC cell samples generated in this study have been deposited in the NCBI GEO database under accession code GSE223027. ChIP-seq data from SAEC cell samples used in this study have been deposited in the DDBJ data base under accession code DRA002311. Proteomics data generated in this study have been deposited in JPOST, a public proteome database certified by the ProteomeXchange Consortium, under accession code PXD043419 (Access key:2101) WGS data generated in this study have been deposited in DDBJ under accession code "PRJDB16238[https://ddbj.nig.ac.jp/resource/bioproject/PRJDB16238]". UniProt human database is available at (https://www.uniprot.org). Human genome (GRCh38) data is available at (https://gdc.cancer.gov/about-data/gdc-dataprocessing/gdc-reference-files ). The Whole Transcriptome Sequencing (WTS) data from NCCJ-cohort that support the findings of this study are not publicly available and restrictions apply to the availability of these data. Such WTS data are available through to the corresponding authors (Bunsyo Shiotani: bshiotan@ncc.go.jp) for academic non-commercial research purposes upon reasonable request, and subject to review of a project proposal that will be evaluated by PRISM data access committee, entering into an appropriate data access agreement and subject to any applicable ethical approvals. The data supporting the findings of this study are available within the Article and its supplementary information files or Source Data files. Source data are provided with this paper.

Human research participants
Policy information about studies involving human research participants and Sex and Gender in Research.

Reporting on sex and gender
Analysis were performed without considering sex and gender.

Population characteristics
Lung cancer patients Recruitment National Cancer Center BioBank.

Ethics oversight National Cancer Center Institutional review board
Note that full information on the approval of the study protocol must also be provided in the manuscript.

Field-specific reporting
Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection.

Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences
For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf

Life sciences study design
All studies must disclose on these points even when the disclosure is negative.

Sample size
No statistical method was used to predict sample size. The sample size was guided by preliminary studies we carried out, and guided by a number of previous publications using similar approaches. Sample size was kept similar amongst different conditions. Sample size for all experiments shown (electron microscopy, n>200 or 1000 in 3 independent experiments; DNA fibers, n>200 in 2 or more independent experiments for IdU/CldU ratio or tract lengths, n>200 in 2 or more independent experiments for sister fork asymmetry; micronuclei, n>200 in 3 or more independent experiments; QIBC, >3000 in 3 or more independent experiments) was chosen to obtain statistical power. When the sample size is less than indicated above, we have provided n numbers in the figure legends. For anchorage-independent growth assay, all of the stained colonies were counted in 3 or more independent experiments.
Data exclusions No data was excluded.

Replication
All experiments were biologically repeated at least twice, at most six times. Each data details were shown in figure legends.
Randomization The experiments were not randomized. We used cell lines only for all experiments. Randomization is not generally used for experiments except for DNA fiber/Immunofluorescence/PLA assay. Obtained raw data of DNA fiber / Immunofluorescence / PLA were randomly selected.

Blinding
Blinding was not relevant to this study because analyses were analyst independent.
nature portfolio | reporting summary

March 2021
Reporting for specific materials, systems and methods We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response.